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Abstract 

We study a random graph model which combines properties of the edge percolation 
model on Z d and a classical random graph G(n,c/n). We show that this model, being 
a homogeneous random graph, has a natural relation to the so-called "rank 1 case" of 
inhomogeneous random graphs. This allows us to use the newly developed theory of 
inhomogeneous random graphs to describe completely the phase diagram in the case 
d = 1. The phase transition is similar to the classical random graph, it is of the second 
order. We also find the scaled size of the largest connected component above the phase 
transition. 



1 Introduction. 

We consider a graph on the set of vertices V$ := {1, . . . , N} d in Z d , where the edges between 
any two different vertices i and j are presented independently with probabilities 



Qi if \i — j\ = 1 

Plj ~ i c/N d , if \i- j\ > 1 



where < q < 1 and < c < N are constants. This graph, call it G%(q, c) is a mixture of 
percolation model, where each pair of neighbours in Z d is connected with probability q, and 
a random graph model, where each vertex is connected to any other vertex with probability 
c/|V#|. 

The introduced model is a simplification of the most common graphs designed to study 
natural phenomena, in particular, biological neural networks [H]. Observe the difference 
between this and the so-called " small- world" models intensively studied after [9,J. In the 
" small- world" models where edges from the grid may be kept or removed, only finite number 
(often at most 2d) of the long-range edges may come out of each vertex, and the probability 
of those is a fixed number. 

We are interested in the limiting behaviour of the introduced graph G%(q, c) as N — ► oo. 
One can consider this model as a graph on Z d or on a torus, in the limit the results are the 
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same. The one-dimensional case which we study here, is exactly solvable. We shall write 
G l N (q,c) = G N (q,c). 

Let X be a random variable with Fs(l — g)-distribution, i.e., 

P{X = k} = (l-q)q k - 1 , A; = 1,2,... , (1.1) 

with 

EX ' 



l-q 

Let further C\ {g \ denote the size of the largest connected component in a graph G 
Theorem 1.1. For any < q < 1 define 

EX l-q 
EX 2 ~ l + q 

i) If c < c cr (q) then there exists a constant a = a(q,c) > , lo * , such that 



^fo) = ^2 = TTZ ■ (1-2) 



and for any a x < yr\ 



|logq| 



?{c x (G N {q iC j) >«logiv}^0, (1.3) 



{c^GWfac)) <a 1 logiv} -.0, (1.4) 



as N — > oo. 

^ If c> c cr (q) then 



C x (G N {q,c) 



*P (1.5) 



as N — > oo ; wift (3 = (3(q, c) defined as the maximal solution to 

P=l-^E{Xe- cX ^}. (1.6) 

Observe the following duality of this result. For any c < 1 we know that the subgraph 
induced in our model by the long-range edges may have at most O(logiV) vertices in a 
connected component. According to Theorem II .1| for any c < 1 there is 

™ = T~Tc 
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such that for all q cr (c) < q < 1 our model will have a giant component with a size of order 
N, while any q < q cr (c) is insufficient to produce a giant component in G^(q,c) . Hence, 
Theorem 11.11 mav also tell us something about the "distances" between the components of 
a random graph when it is considered on the vertices of Z. 

Remark 1.1. In the proof of Al.ty) we will show how to obtain a(q, c), and will discuss how 
optimal this value is. Statement \1.4\) is rather trivial (and far from being optimal): it follows 
from a simple observation (see the details below) that 

pjc^G^c)) < ^logiv}<p|c 1 (G' i v( g ,0))< J^^ N }^°- 

Remark 1.2. For any fixed c function /3(q,c) is continuous atq = 0: ifq — 0, i.e., when our 
graph is merely a classical G„, c / n random graph, then X = 1 and M.n}) becomes a well-known 
relation. Equation M.b]) can be written in an exact form: 

(3 = 1 o(l-g) 2 . 



It is easy to check that if c < c cr then the equation (jl.6|) does not have a strictly positive 
solution, while (3 = is always a solution to (|1.6|) . Therefore one can derive 

^ cUicCr ~ (EX) 2 E(X3)" g 2 + 4g + r [L7) 

This shows that the emergence of the giant component at critical parameter c = c cr becomes 
slower as q increases, but the phase transition remains of the second order (exponent 1) for 
any q < 1. 

We conjecture that similar results hold in the higher dimensions if q < Q cr (d), where 
Q cr (d) is the percolation threshold in the dimension d. More exactly, Theorem 11.11 fas well 
as the first equality in (|1.7j) ) should hold with X replaced by another random variable, 
which is stochastically not larger than the size of the open cluster at the origin in the edge 
percolation model with a probability of edge q. It is known from the percolation theory 
(see, e.g., that the tail of the distribution of the size of an open cluster in the subcritical 
phase decays exponentially. This should make possible to extend our arguments (where we 
use essentially the distribution of X) to the general case. 

Our result in the supercritical case, namely equation (jl.6j) looks somewhat similar to the 
equation obtained in [2] for the "volume" (the sum of degrees of the involved vertices) of the 
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giant component in the graph with a given sequence of the expected degrees. Note, however, 
that the model in (as well as the derivations of the results) differs essentially from the one 
studied here. In particular, in our model the critical mean degree when c = c cr and N — ► oo 
is given by 

2q + c cr = 2q + ^- = l + ^- (1.8) 
1 + q 1 + q 

which is strictly greater than 1 for all positive q. This is in a contrast with the model studied 
in 0, where the critical expected average degree is still 1 as in the classical random graph. 

Although our model (when considered on the ring or torus in higher dimensions) is a 
perfectly homogeneous random graph, in the sense that the degree distribution is the same 
for any vertex, we study it via inhomogeneous random graphs, making use of the recently 
developed theory from 1J. The idea is the following. First, we consider the subgraph 
induced by the short-range edges, i.e., the edges which connect two neighbouring nodes with 
probability q. It is composed of the consecutive connected paths (which may consist just 
of one single vertex) on Vjy = {1, . . . , N}. Call a macro-vertex each of the component of 
this subgraph. We say that a macro-vertex is of type k, if k is the number of vertices in 
it. Conditionally on the set of macro- vertices, we consider a graph on these macro- vertices 
induced by the long-range connections. Two macro- vert ices are said to be connected if there 
is at least one (long-range type) edge between two vertices belonging to different macro- 
vertices. Thus the probability of an edge between two macro-vertices Vi and Vj of types x 
and y correspondingly, is 

p xy (N) := 1 - (l - ^-Y y . (1.9) 



Below we argue that this model fits the conditions of a general inhomogeneous graph model 
defined in |Tj, find the critical parameters and characteristics for the graph on macro- vertices, 
and then we turn back to the original model. We use essentially the results from pQ to derive 
(II. 6|) . while in the subcritical case our approach somewhat differs from the one in pQ; we 
discuss this in the end of Section 2.4. We shall also note that our graph on macro-vertices 
is similar to the model studied in 0, and our results on the critical value agree with those 
in 0. 

Finally we comment that our result should help to study more general model for the 
propagation of the neuronal activity introduced in jHj. Here we show that a giant component 
in the graph can emerge from two sources, none of which can be neglected, but each of which 
may be in the subcritical phase, i.e., even when both q < 1 and c < 1. In particular, for 
any < c < 1 we can find q < 1 which allows with a positive probability the propagation of 
impulses through the large part of the network due to the local activity. 
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2 Proof 



2.1 Random graph on macro- vert ices. 

Denote X a random number of the vertices connected through short-range edges to the 
vertex 1 on = {1,2, . . .}. Clearly, X has the First success distribution defined in (jl.lj) . 
Let Xi, X 2 , . . . , be independent copies of X, and define for any N > 1 

n n+1 

T(N) := min{n > : ^ Xi < N, ^ X, > N}, 

1=1 i=l 

where we assume that a sum over an empty set equals zero. 

Consider now the subgraph on Vn = {1, . . . , N} induced by the short-range edges. This 
means that any two vertices i and i+1 from Vn are connected with probability q independent 
of the rest. By the construction this subgraph, call it G N \q), is composed of a random 
number of connected paths of random sizes. We call here the size of a path the number of its 
vertices. Clearly, there is a probability space (O, JF, P) where the number of paths in G N \q) 
equals T(N) if ^fjf } X t = N, or T(N) + 1 if ^ZT } X i < N - Correspondingly, the sizes of 
the paths follow the distribution of 

T(N) 

X=(X 1 ,X 2 ,...,X T{N) ,N-J2x i ) (2.1) 

i=i 

(where the last entry may take zero value). 

On the other hand, the number of the connected components of G N \q) exceeds exactly 
by one the number of "missed" short edges on Vn- This means that on the same probability 
space (Q, J 7 , P) there is a random variable Yn distributed as Bin(N — 1, 1 — q), such that 
either T(N) — Yn + 1 or T(N) + 1 = Yn + 1, and in any case 

0<T(N)-Y N <1. (2.2) 
This together with the Strong Law of Large Numbers implies 
Proposition 2.1. 

T{N) a . s . 1 

1 - q 



N 5 EX 

as N — ► oo. □ 
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Also the relation (|2.2|) allows us to use the large deviation inequality from j3] (formula 
(2.9), p. 27 in for the binomial random variables in order to obtain the following rate of 
convergence 

T(N) 



N EX 



>5}<2ex P [- T ^-— ) N) (2.3) 



for all 5 > and N > 2/5. 

Define for any k > 1 an indicator function 



h(x) 



1, if x = k, 
0, otherwise. 



As an immediate corollary of Proposition 12. II and the Law of Large Numbers we also get the 
following result. 

Proposition 2.2. For any fixed k > 1 

T(N) 

— £ I k (Xi) $ P{X = k} = (1 - g^ 1 =: /i(/c) (2.4) 

as A — > oo. □ 

Given a vector of paths X defined in (12. lj) . we introduce another graph Gjv(X, g, c) as 
follows. The set of vertices of GV(X, q, c) we denote {vi, . . . , Ut(at)}- Each vertex Vi is said 
to be of type A,, which means that vi corresponds to the set of A, connected vertices on Vjv. 
We shall also call any vertex V{ of Gat(X, q, c) a macro-vertex, and write 

f {l,.-.,^}, ifi = l; 

Vi = { (2.5) 

I {ejs a, + i, . . . , e;=1 a, + a,}, if < > i. 

With this notation the type of a vertex t>j is simply the cardinality of set v^. The space of 
the types of macro-vertices is S = {1, 2, . . .}. According to ()2.4|) the distribution of type 
of a (macro-)vertex in graph Gjv(X, g, c) converges to measure /x on 5. The edges between 
the vertices of Gjv(X, g, c) are presented independently with probabilities induced by the 
original graph Giv(g, c). More precisely, the probability of an edge between any two vertices 
Vi and Vj of types x and y correspondingly, is p xy (N) introduced in (jl.9|) . Clearly, this 
construction provides a one-to-one correspondence between the connected components in 
the graphs Gjv(X, q, c) and G^{q, c): the number of the connected components is the same 
for both graphs, as well as the number of the involved vertices from in two corresponding 
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components. In fact, considering conditionally on X graph Gn(X, q, c) we neglect only those 
long-range edges from G^(q, c), which connect vertices within each Vi, i.e., the vertices which 
are already connected through the short-range edges. 
Consider now 

cyy K' N (x,y) 



P„(N) = l-[l--) =:^^. (2.6) 

Observe that if x(N) — > x and y(N) — > y then 

K' N (x(N),y(N))^cxy (2.7) 

for all x,y G S. In order to place our model into the framework of the inhomogeneous 
random graphs from pQ let us introduce another (random) kernel 

T(N) 

K T (N)(x,y) = -^—K N (x,y), 

so that we can rewrite the probability p xy (N) in a graph Gn(X., q, c) taking into account the 
size of the graph: 

(We use notations from pQ whenever it is appropriate.) According to Proposition 12.11 and 
(127711 . if x(N) -> x and y(N) -> y then 

K T{N - ) (x(N),y(N)) -> n(x,y) := |T^2/ a - s - ( 2 -9) 

as iV — > oo for all a;, y G 5 1 . 

Hence, in view of Proposition 12.21 we conclude that conditionally on T(N) = t(N), where 
t(N)/N — > 1/EJf, our model falls into the so-called "rank 1 case" of the general inhomoge- 
neous random graph model G v (t(N), Kttm) with a vertex space V = (S, fi, (Xi, . . . , ^(jv))jv>i) 
from (1 3 (Chapter 16.4). Furthermore, it is not difficult to verify with a help of the Proposi- 
tions O and ET21 that 

kg L l (S x S,fix n), (2.10) 



since 

oo oo 



y=l x=l 

and for any t(N) such that t(N)/N -> 1/EX 



— -E{e(G N (X,q,c))\T(N) = t(N)} -> - £ £ «(x, y)(*(x)(A(y) , (2.11) 



where e(G) denotes the number of edges in a graph G. According to Definition 2.7 from PQ, 
under the conditions (|2.11j) . (|2.1U|) and (|2.9|) the sequence of kernels Kt(N) (on the countable 
space S x S) is called graphical on V with limit k. 

2.2 A branching process related to Gjv(X, q, c). 

Here we closely follow the approach from pQ . We shall use a well-known technique of branch- 
ing processes to reveal the connected component in graph Gjv(X, q, c). Recall first the usual 
algorithm of finding a connected component. Conditionally on the set of macro- vertices, 
take any vertex Vi to be the root. Find all the vertices {vj , i>K v^} connected to this 
vertex V{ in the graph Gtv(X, q, c), call them the first generation of Vi, and then mark Vi as 
"saturated". Then for each non-saturated but already revealed vertex, we find all the ver- 
tices connected to them but which have not been used previously. We continue this process 
until we end up with a tree of saturated vertices. 

Denote t n (x) the set of the macro- vertices in the tree constructed according to the above 
algorithm with the root at a vertex of type x. 

It is plausible to think (and in our case it is correct, as will be seen below) that this 
algorithm with a high probability as iV — > oo reveals a tree of the offspring of the following 
multi-type Galton- Watson process with type space S — {1,2, . . .}: at any step, a particle 
of type x G S is replaced in the next generation by a set of particles where the number 
of particles of type y has a Poisson distribution Po(k(x, y)p,(y)). Let p(k;x) denote the 
probability that a particle of type x produces an infinite population. 

Proposition 2.3. The function p(k;x), x G S, is the maximum solution to 

P (k;x) = 1 - e -££i*(*.¥MvM«;i/). (2.12) 

Proof. We have 

oo 

y=l % 

which together with (J2.1(Jj) verifies that the conditions of Theorem 6.1 from pQ are satisfied, 
and the result ()2.12|) follows by this theorem. □ 

Notice that it also follows by the same Theorem 6.1 from pQ that p(n; x) > for all x G S 
if and only if 

c ^ 2 EX 2 l+q 

y=l 
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otherwise, p(n; x) = for all x G S. Hence, the formula (|1.2j) for the critical value follows 
from (|XT3|) . 

As we showed above, conditionally on T(N) (so that T(N)/N — > 1/EX) the sequence 
kt(n) is graphical on V. Hence, the conditions of Theorem 3.1 from pQ are satisfied and we 
derive (first, conditionally on T(N), and therefore unconditionally) that 

d^X^c)) P 
T[N) >M ' 

where p(n) = Yl™=i p( K '-> x)p(x). This together with Proposition 12. II on the a.s. convergence 
of T(N) implies 

C 1 (G N (X, g ,c)) I>{1 _ q)p{K) (214) 

Notice that here Ci(GV(X, q, c)) is the number of macro- vertices in G^(K, q, c). 

2.3 On the distribution of types of vertices in Grj\r(X, q, c). 
Given a vector of paths X (see (J2.1|0 we define a random sequence 

where 

T(N) 

A4=A4(X)= J^IkiXi). 

1=1 

In words, A4 is the number of (macro-)vertices of type k in the set of vertices of graph 
Gtv(X, q, c). We shall prove here a useful result on the distribution of Af (which is stronger 
than Proposition 12. 2|) . 

Lemma 2.1. For any fixed e > 

P{\N k /T(N) -fi(k)\ > ekp(k) for some 1 < k < N} = o(l) (2.15) 

as N — » oo. 

Proof. Let us fix e > arbitrarily. Observe that for any K > 1/e 

P{\AT k /T(N) -/i(k)\ > ek/i(k) for some 1 < k < N} (2.16) 
< P{ max Xi > K] 

l<i<T(N) 



+P{\Af k /T(N) - fi(k)\ >ek^(k) for some 1 < k < K}. 

Next we shall choose an appropriate K = K(N) so that we will be able to bound from above 
by o(l) (as N — ► 00) each of the summands on the right in (j2.16|) . 
Let us fix 5 > arbitrarily, and define an event 

T(N) 1 



A 



5,N 



N EX 



< 6 



Recall that according to (|2.3j) 

P(A 5 ,n) > 1 - 2exp 
as N — > 00. Now we derive 



12(1 -q) 



N = l-o(l) 



P{ max Xi > K\ < P{ max X { > K I As tv}P(A 5 at) + P{A S N } 

l<i<T(N) l<i<T(N) ' ' 



1 



(2.17) 



(2.18) 



(2.19) 



EX 

< 



< — + 6) NPiX, > K I A s ,n}P(As,n) + P{A 5 , N } 



EX 



+ 5 )NP{X 1 >K} + P{As, N } 



as N — > cxd. Making use of the formula (jl.l|) for the distribution of Xi we obtain from (|2.19j) 
and (l2~T%J) 

P{ max >K}< CNq K + 2 exp 

l<i<T(jV) 



N 



(2.20) 



12(1 -g) 

as X — > cxd, where C = C(5, g) is some finite positive constant. Let now ui(N) < N be any 
function tending to infinity with N, and set 

1 



log? I 



logJV + wi(JV). 



Clearly, bound (|2.2(J|) with K replaced by K(N) implies 

P{ max Xi > K(N)\ = o(l) 

as N — >• 00. 

Now we consider the last term in (J2.16|) . Let us define 



(2.21) 



(2.22) 



kn := max 



logg| 



+ 2. 



(2.23) 
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Then we obtain making use of (J2.18)) 

P{\Af k /T(N) - fi(k)\ >ek/j(k) for some 1 < k < K(N)} 

k i T(N) 



(2.24) 



^ |j P{l T^V) E 4 ™ " MA:)I > 



K(AT) 



T(N) 



+ E p {l^nE^ra-M^)l>^/i(^)l ^}P(Ajv) + o(i) 



*0 



T(N) 



E P {l^nE AW -/*(*) I > efcM*) I Aiv}P(Aiv) 

fe=fc + l ^ ' 1=1 

as iV — > oo, where the last equality is due to Proposition 12.21 Notice that for each k > ko 
we have e k > 1 and therefore 



T(AO 

T(iV)E J ^)-M^) 



> £ A;/i(A;) | A, 



■S,N 



(2.25) 



< P 



r i t(jv) "I 

[(lTx+< 5 H+ 1 

^ 4(Xi) > (i(k) + e kfi(k) | A 5 ,n } =■ P(fc). 



«=i 



Set £(iV) = [(^ + 5) N] + 1. Then using the bound 



t{N) >1 ~2 EXS 



for all N > 2/8, we derive 



r 1 t(^) 

P(*) < P < ^X>M > A*(*)(l + e*)(l 



5EX 



5)\A, 



-S,N 



(2.26) 
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for all N >2/5. Now for all k > k and < 8 < we have (1 + e k)(l - > 1 + § k, 
and therefore 

f 1 t(N) ) 

P(k) < P < E > + l k ) I Aiv ^ (2.27) 



< P 



( 1 t(iv) ] 



Note that Ik(Xi) follows the binomial distribution Bin(t(N), fi(k)). This allows us to 

use the large deviation inequality from jl] (see (2.5), p. 26 in jl]) and derive 



1 t(N) ) 

T) J2h(X l )>fi(k)(l + -k)\ (2.28) 



< 



exp 



t(N) 
{lkn(k)t(N)f 



-) < exp ( - l-ekfi{k)t{N)) 



\ek^{k)t{N) + 2n(k)t(N) J ~ ^ K 10 
for all k > k . Substituting this into (|2.27|) we obtain 

P(k) < exp ( - ±- £ k^{k)t{N)) / P{A S , N ) (2.29) 

for all k > k . The last bound combined with ()2.25|) and ()2.24|) leads to 

P{\Af k /T(N) - fi(k)\ >ek/j(k) for some 1 < k < K(N)} 

K(N) 

< Bxp(-—ekfi(k)t{N)) +o(l), 

k=k +l 

as N ^ oo. Taking into account that function kfi(k) is decreasing for k > ko we derive from 
the last bound: 

P{\AT k /T(N) - n(k)\ >ekfi(k) for some 1 < k < K(N)} (2.30) 
< exp ( - ±sK(N)n(K(N))t(N) + logK(N)) + o(l), 

as iV — > oo. 

Setting now ui(N) = log log log iV in (|2.21|) . it is easy to check that for 

K(N) = i , log N + log log log N 
\\ogq\ 

the entire right-hand side of the inequality (|2.30|) is o(l) as N — > oo. This together with the 
previous bound ()2.22j) and inequality (|2.1fi|) finishes the proof of lemma. □ 
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2.4 Proof of Theorem 11.11 in the subcritical case c < c cr (q) . 

Let us fix < q < 1 and then c < c cr (q) arbitrarily. Given X let again Vi denote the macro- 
vertices with types X iy i = 1,2,..., respectively, and let L denote a connected component 
in Gat(X, q, c). Firstly, for any K > and < 5 < 1/EX we derive with help of ()2.18j) 

p{c 1 (G JV (g,c)) < K} < pjc^GVM)) <ir} = p| i< max r) X i < k\ (2.31) 
<p{ max X t < K \ A 5N \ + o(l) < (1 - P {X > K}) N ^^ + o(l), 

l^l<i<T(Af) j 

as X — > oo, where X has the Fs(l — g)-distribution. Since 

P{X>K} = q K ~\ 
we derive from ()2.31|) for any a\ < „ , and X = ai log X 

p{tfi(GW(g,c)) <oilogiv}=o(l), 

which proves statement (jl.4j) . 

Consider now for any positive constant a and a function if = w(N) > logiV 

P |(7i (Gjsr(q, c) j > aw j = P < max Xi > aw > . (2.32) 

I, Vi&L ) 

We know already from ()2.14|) that in the subcritical case the size (the number of macro- 
vertices) of any L is whp o(N). Note that when the kernel k(x, y) is not bounded uniformly 
in both arguments, which is our case, it is not granted that the largest component in the sub- 
critical case is at most of order logiV (see, e.g., discussion of Theorem 3.1 in PQ). Therefore 
first we shall prove the following intermediate result. 



Lemma 2.2. If c < c cr (q) then 



P 



{c a (GV(X,g,c)) > X 1 / 2 } = o(l). (2.33) 



Proof. Let us fix e > and 5 > arbitrarily and introduce the following event 



Bn ■= A s N n I max X < — . logiV ) n ( nf_ x 
1 l<i<T(7V) I log g| ' 1 
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T(N) 



< ekfi(k) } . (2.34) 



According to (ETTSD . (12~221) and (12~T5]) we have 



P{B N } = l-o(l) (2.35) 

as iV — > oo. 

Recall that t n (x) denotes the set of the macro- vertices in the tree constructed according 
to the algorithm of revealing of connected component described above. Let |rjv(x)| denotes 
the number of macro- vert ices in tn(x). Then we easily derive 



{c 1 (G JV (X,g,c)) > iV 1 / 2 } < P {^max^ MX,)] > N 1 ' 2 \ B N } 



+ o(l) (2.36) 



N 

< N Y,(~L + zk)v{k)(5 + 1/EX^)P {\r N {k)\ >N 1 ' 2 \B N } + o(l) 



as N —>■ oo. We shall use the multi-type branching process introduced above (Section 2.2) 
to approximate the distribution of |rjy(A;)|. Let further X c,q (k) denote the number of the 
particles (including the initial one) in the branching process starting with a single particle 
of type k. Observe that at each step of the exploration algorithm, the number of new 
neighbours of x of type y has a binomial distribution Bin(N y ,p xy (N)) where N' y is the 
number of remaining vertices of type y, so that N' y < M y . 

We shall explore the following obvious relation between the Poisson and the binomial 
distributions. Let Y UiP e Bin(n,p) and Z a £ Po(a), where < p < 1/4 and a > 0. Then for 
all k > 

P{Y n , p = k}<(l + Cp 2 ) n P{Z n _L_ = k}, (2.37) 
where C is some positive constant (independent of n and p) . Notice that for all x,y < 

|logg| to 

/ c \ x v c 

p xy (N) = 1 - (l - — ) =-xy(l + o(l)), (2.38) 

and clearly, p xy (N) < 1/4 for all large N. Therefore for any fixed positive E\ we can choose 
small e and 5 in ()2.34j) so that conditionally on Bn we have 

% /_ x ~ N L < (i + vei)tiv)<z, v) (2-39) 

for all large N. Let us write further 

Kv) = ^q(y)> v<i = Yl y^i(y) (= EX )> «fo 2/) = « c , g (^, y) 

y>i 
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emphasizing dependence on q and c. Then for any e 2 > and any q' > q such that 
i^YZ^ < 1 + £2 we can choose S\ < log(<//g), and derive from (|2.39j) 

N v /_ X ~ N L < (1 + e 2 )pt q r(y)K c , q (x, y) = N (y) (1 + £a) ° xy. (2.40) 

-L p X y{l\ ) fXq 

Setting now c' := (1 + £2)^7- c we rewrite ()2.4()j) as follows 

^ l-^ir) - MvW*. v)- ( 2 - 41 ) 

Recall that above we fixed g and c < c cr (g), where c cr (g) is decreasing and continuous in q. 
Hence, we can choose q' > q and d := (1 + e 2 ) — c so that 

c<c' < c cr {q') < c cr (q), (2.42) 

and moreover c' and q' can be chosen arbitrarily close to c and q, respectively. 

Now conditionally on Bn we can replace according to ()2.37j) at each (of at most N) 
step of the exploration algorithm the Bin(Ny,p xy (N)) variable with Po{N' y ^^ N ^ ), and 
further replace the last variables with the stochastically larger ones Po(n q i(y)K c i tq r(x,y)) 
(recall (|2.41)0 . As a result we get the following bound using branching process: 

P{\T N (k)\>N x / 2 \B N } (2.43) 
<(l + c( max p xv (N)] ) P {x c '> q '{k) > N 1 ' 2 

- y Vx,j / <2iog7v/iio g(? i y J J I 

This together with (J2.38j) implies 

P {\r N (k)\ > iV 1/2 I B N } < e h ^ N)i P [X c '- g '(k) > iV 1 / 2 } , (2.44) 
where b is some positive constant. Substituting the last bound into (j2.36J) we derive 

p{c 1 (G JV (X,g,c)) >iV 1 / 2 } (2.45) 

N 

< b 2 Ne b(logN)4 kfiq,(k)P \x c '' q \k) > N 1/2 \ + o(l) 



k=l 
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as iV — > oo, where 62 is some positive constant. By the Markov's inequality 

P{X c '>«'(k) > N 1 ' 2 } < Z ^ 1/2 E / C '''W (2.46) 

for all z > 1. Denote /i,(jfe) = E** eV W; then with a help of we get from (l2~2Kj) 

v 

P{c 1 (G JV (X,g,c)) > A^ 1 / 2 } <6 1 iVe b(losiV)2 2- iVl/2 ^A;/i (? ,(A;)^(A;) + o(l). (2.47) 

fc=i 

Now we will show that there exists z > 1 such that the series 

00 

k=i 

converge. This together with (J2.47)) will clearly imply the statement of the lemma. 

Note that function h z (k) (as a generating function for a branching process) satisfies the 
following equation 

h z (k) = zexp{J27=i K c i t qi(k,x)fi q ,(x)(h z (x) - 1)} 

= Z eX P { Jj k (E^=l X N (%) h z(x) ~ fig') } 

= zexp^k(B z (cf,q') - fi q >)} . 
Multiplying both sides by kfi q i(k) and summing up over we find 

B z (c',q') = V ]kfi q >(k)zexp \ —k(B z (c',q') - fi q >) \ ■ 

Let us write for simplicity B z = B z (c,q). Hence, as long as B z is finite, it should satisfy 
equation 

B z = zEXe^ I(& - EI) , (2.48) 

which implies in turn that B z is finite for some z > 1 if and only if (j2.48|) has at least one 
solution (for the same value of z). Notice that 

B Z >B 1 = EX (2.49) 
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for z > 1. Let us fix 2 > 1 and consider equation 

y/z = EX e^ x{y - EX) =: F(y) (2.50) 
for y > EX. Using the properties of the distribution of X it is easy to derive that function 

1 e^" 1 ) 



F(y) 



EX (1 _ qe c(^-i)y 



is increasing and has positive second derivative if EX < y < yo, where yo is the root of 
1 = oexp {c(^x — 1)}. Compute now 

^F(y)\^ x = ^EX^A (2.51) 

Hence, if c < c cr then there exists z > 1 such that there is a finite solution y to (|2.5()|) . 
Taking into account condition 1)2.42)1 . we find that B z (c',q') is also finite for some z > 1, 
which finishes the proof of the lemma. □ 

Now we are ready to complete the proof of p. 3)1 . following almost the same arguments as 
in the proof of the previous lemma. Let Sn(x) = J2 V . &TN ( X ) Xj denote the number of vertices 
from Vat which compose the macro- vertices of rjv(rr). Denote 



B' N :=B N n( max \r N {Xi)\ < N 1 / 2 ) . 

\l<i<T(N) ) 



According to ()2.35jl and Lemma [2.21 we have 

P{B' N } = l-o(l). 

This allows us to derive from ()2.32j) 



P [Ci (G N (q, c)) > aw} < P j^max^ S N (Xi) > aw | B^j + o(l) 



(2.52) 



N 

< 

k=l 



Nj2( 1 + £k )v( k )( 6 + 1 / EX ) I> { S N(k) > aw I B^} + o(l). 



Let now S c ' q (y) denote the sum of types (including the one of the initial particle) in the 
total progeny of the introduced above branching process starting with initial particle of type 
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y. Repeating the same argument which led to (|2.43j) . we get the following bound using the 
introduced branching process: 

P {S N (k) > aw I B' N } < 1 + C ( max p xy (N) ) P \s c '> q '(k) > aw] 

\ \x,y<2io g N/\io gq \ J J { > 



as N — > oo, where we take into account that we can perform at most v N steps of exploration 
(the maximal possible number of macro- vertices in any L). This together with ()2.38j) implies 

P {r N (k) > aw | B' N } < (1 + o(l))P {s c '> q '{k) > aw} (2.53) 
as iV — > oo. Substituting the last bound into ()2.52|) we derive 

N 

P lCi(G N (q,cj) > aw} < NbJ2kv q >{k)P {S c '' q '{k) > aw} + o(l) (2.54) 



k=l 



as N — > oo, where 6 is some positive constant. Denote g 2 (fc) = Ez sc ,q then similar to 
(j23Zt we derive from (1231) 

P{Cifev(g,c)) >aw(iV)} < oiV k^{k)g z {k) Z - aw ^ + o(l). (2.55) 

k=l 

We shall search for all z > 1 for which the series 

oo 
k=l 

converge. Function g z (k) (as a generating function for a certain branching process) satisfies 
the following equation 

g z (k) = z k ex.p{J2™=iKc>,q'(k,x)iJ, ql (x)(g z (x) - 1)} 
= z k ex.p^k{J2^ =1 x^ q >(x)g z (x) -jy)} 

= z fc exp{^(^(c',</)-/v)}- 
Multiplying both sides by k/J, q i(k) and summing up over we find 

^(cW) = f>/y(A;)^exp (— ^(c', g') -/y)\ . (2.56) 
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It follows from here (and the fact that A z {d ', q') > fi q > for all z > 1) that if there exists z > 1 
for which the series A x (d, q') converge, it should satisfy 

z <-. (2.57) 
Q 

According to ()2.56j) . as long as A z = A z (c, q) is finite it satisfies the equation 

A z = EXz x e^ x ^- EX \ 

which implies that A z is finite for some z > 1 if and only if the last equation has at least 
one solution 

A z > At = EX. (2.58) 
Let us fix z > 1 and consider equation 

y = El/ e ^ x{y - EX) =: f(y, z). (2.59) 

Using the properties of the distribution of X it is easy to derive that 

1 ze^Ex -1 "' 



f(y,z) 



EX (i- qZ e^-^y 

We shall consider f(y,z) for y > EX and 1 < z < ie"^- 1 ). It is easy to check that in 
this area function f(y,z) is increasing, it has all the derivatives of the second order, and 
J^sf(y,z) > 0. Compute now 





EX EX ~ ^ 

Hence, if c > c cr there is no solution y > EX to ()2.59|) for any z > 1. On the other hand, if 
c < c cr then there exists 1 < Zq < 1/q such that for all 1 < z < Zq there is a finite solution 
y > EX to ([2.59)1 . One could find Zq for example, as the (unique!) value for which function 
y is tangent to f(y, z ) if y > EX. 

Now taking into account that d > c and q' > q can be chosen arbitrarily close to c and 
q, respectively, we derive from ()2.55j) that for all 1 < z < zq 

P {Ci (G N {q, c)) > aw{N) } < b{z)N Z - aw{N) + o(l) (2.61) 

as N — ► oo, where b(z) < oo. This implies that for any a > 1/ log^o > 1/| log g| 

pjc^G^c)) >alogiv} =o(l) (2.62) 
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as N — > oo, which proves (jl.3|) . 



□ 



To conclude this section we comment on the methods used here. It is shown in jH] that 
in the subcritical case of classical random graphs the same method of generating functions 
combined with the Markov inequality leads to a constant which is known to be the principal 
term for the asymptotics of the size of the largest component (scaled to log N) . This gives us 
hope that a constant a chosen here to satisfy a > 1/ log zq is close to the minimal constant 
for which statement (J2.62)) still holds. 

Similar methods were used in [7] for some class of inhomogeneous random graphs, and in 
for a general class of models. Note, however, some difference with the approach in pQ. It 
is assumed in pQ, Section 12, that the generating function for the corresponding branching 
process with the initial state k (e.g., our function g z {k), k > 1) is bounded uniformly in k. 
As we prove here this condition is not always necessary: we need only convergence of the 
series A z , while g z (k) is unbounded in k in our case. Furthermore, our approach allows one 
to construct constant a(q, c) as a function of the parameters of the model. 

2.5 Proof of Theorem 11.11 in the supercritical case. 

Let Cfc denote the set of vertices in the fc-th largest component in graph G]y(q,c), and 
conditionally on X let Ck denote the set of macro- vert ices in the k-th largest component 
in graph GV(X, q,c) (ordered in any way if there are ties). Let also C& and Ck denote 
correspondingly, their sizes. According to our construction for any connected component L 
in Gat(X, q, c) there is a unique component L in G^{q,c) such that they are composed of 
the same vertices from Vn, i.e., in the notations (|2.5|) 



Next we prove that with a high probability the largest components in both graphs consist 
of the same vertices. 

Lemma 2.3. For any < q < 1 if c > c cr (q) then 



L = U vel U kev {k} =: V(L). 



P{d = V{C X )} = 1 - o(l) 



(2.63) 



as N 



oo. 



Proof. In a view of the argument preceeding this lemma we have 



P{Ci ^ V(d)} = P{Ci = V{C k ) for some k > 2}. 
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According to Theorem 12.6 from [l.J, conditions of which are satisfied here, in the supercritical 
case conditionally on T(N) such that T{N)/N -»■ 1/EX, we have whp C 2 = 0(log(T(N))), 
which by Proposition 12.11 implies Ci = 0(logN) whp . Also we know already from (|2.14j) 
that in the supercritical case C% = O(N) whp, and therefore C\ = O(N) whp. Hence, for 
some positive constants a and b 

P{Ci ^ V(Ci)} = P{Ci = V(C k ) for some k > 2} (2.64) 

< P | (max \V(C k )\ > bN^j n fmaxC fc < alog i\A J + o(l). 

Define now for any K = K(N) a set 

B N := {3X { > K for some 1 < % < T(N)}. 

According to (12~2UD 

P(Bjv) < CWg* + o(l) 

as A" — > oo for some constant C independent of X and AT. Setting from now on K = \/~N 
we have P(B N ) = o(l) as A" — > oo. Then we derive 

P | Uax | F (Cfc) | > 6AM n fmax C k < a log A^ | (2.65) 
< P i ( max|F(C fc )| > bN ) n [max^ < alogAT ) n ( max X < v 7 ^ ) i + o(l) 

t V fc - 2 J \k>2 J \l<i<T(N) J J 

< P jv/iValogX > Wvj + 0(1) = o(l). 
Substituting this bound into (|2.64|) we immediately get (|2.63j) . □ 

Conditionally on C\ = V(C\) we have 

§ =^E2f ) ^i{^GC 1 } 

= f Efif ) Eti kl{X, = k}l{ Vl G Ci} (2.66) 

Note that Theorem 9.10 from pQ (together with Proposition 12. II in our case) implies that 

Mk) ■= ■fT^jMvi e Ci(JV) : X = A;} ^ fc)/x(fc) (2.67) 
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for each k > 1. 

We shall prove below that also 



N oo 
P 



W N : = kv N (k) £ kp(n; k)fi(k) =: (3 EX. (2.68) 



k=l k=l 



Observe that fc) is the maximal solution to ()2.12|) . therefore (3 is the maximal solution 
to 

oo oo 

P = ex ^ = EjSK 1_e " ^ " (fc,tfHw)p(,titf) ) A*(*) 

fe=i fe=i 

= l_^ E (Xe- c ^ 
EX V 

This proves that (3 is the maximal root of (jl.6j) . Then ()2.68|) together with Proposition 12. 11 
which states that T(N)/N 1/EX, will allow us to derive from ()2.66|) that for any positive 

e 



f Ci[G N (c,q)) _ . 

'{l^^V -i3\>e\C 1 = V{C X )} -> 



as X — > oo. This combined with Lemma 12. HI would immediately imply 



X 

and hence the statement of the theorem follows. 



- P, (2.69) 



Now we are left with proving ()2.68j) . For any 1 < R < X write Wn '■= W§ + w§, where 

R N 



k=l k=R+l 

By ()2.67|) we have for any fixed R > 1 

R 



W n ^ ^2kp(K;k)n(k) (2.70) 



rR p 

k=l 

as X — > oo. Consider w§. Note that for any k > 1 

T(iV) 

Ei/*(*) = Ee{^(A;) I T(X)j < E— £ P{X, = fc | T(X)}. (2.71) 
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Using events A$,n with bound (J2.18)) and Proposition 12. II we obtain from (|2.71|) for any fixed 
< 5 < 1/(2EX) 

, T(N) 

Ev N (k) < E— P ^ = k I T ( N )WAn} + P{A^} 

< j^^x) p{Xl = m + o(1)) + p{ ^ } - 

Bound (|2.18|) allows us to derive from here that 

Eis N (k) < A x {p{k) + e- aiN ) (2.72) 
for some positive constants A\ and a\ independent of k and N. This yields 

N 

Ew% = kE "N{k) < A 2 e~ a2R (2.73) 

k=R+l 

for some positive constants A 2 and a 2 . 

Now for any e > we can choose R so that 

oo 

kp(K;k)/i(k)<£/3, 

k=R+l 

and then we have 

oo 

P{\W N - Ykp(K] k)fi(k)\ > e} (2.74) 
k=i 

R oo 

= P{\(W§-^2kp(K;k)fi{k))+u4- Yl kp{K-k)ii(k)\>e} 

k=l k=R+l 
R 

< P{\W* - k P(K] k)p(k)\ > e/3} + P{?4 > e/3}. 

k=l 

Markov's inequality together with bound (|2.73|) gives us 

f p , , 3Ew% 3A 2 e~ a2R 
P{w§ > s/3} < ^ < — . (2.75) 
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Making use of (|2.75|) and (J2.7U)) we immediately derive from (|2.74|) 

°° 1Anp- a2R 
P{\W N -^2kp(K',k) t M(k)\>e}<o(l) + — (2.76) 

fe=i 

as N — > oo. Hence, for any given positive e and Eq we can choose finite R so large that 

oo 

lim *P{\W N - V kp(K] k)fJL(k)\ > e} < e . (2.77) 

fe=l 

This clearly proves statement (|2.68|) . and therefore finishes the proof of the theorem. □ 
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